Continuous Distributed Counting for Non-monotonous Streams
نویسندگان
چکیده
We consider the continual count tracking problem in a distributed environment where the input is anaggregate stream originating from k distinct sites and the updates are allowed to be non-monotonous, i.e. both incre-ments and decrements are allowed. The goal is to continually track the count within a prescribed relative accuracyat the lowest possible communication cost. Specifically, we consider an adversarial setting where the input valuesare selected and assigned to sites by an adversary but the order is according to a random permutation or is a randomi.i.d process. The input stream of values is allowed to be non-monotonous with an unknown drift −1 ≤ μ ≤ 1 wherethe case μ = 1 corresponds to the special case of a monotonous stream of only non-negative updates. We show thata randomized algorithm guarantees to track the count accurately with high probability and has the expected commu-nication cost Õ(min{√k/(|μ| ),√kn/ , n}), for an input stream of length n, and establish matching lower bounds.This improves upon previously best known algorithm whose expected communication cost is Θ̃(min{√k/ , n}) thatapplies only to an important but more restrictive class of monotonous input streams, and our results are substantiallymore positive than the communication complexity of Ω(n) under fully adversarial input. We also show how our frame-work can also accommodate other types of random input streams, including fractional Brownian motion that has beenwidely used to model temporal long-range dependencies observed in many natural phenomena. Last but not least, weshow how our non-monotonous counter can be applied to track the second frequency moment and to a Bayesian linearregression problem.
منابع مشابه
Counting Arbitrary Subgraphs in Data Streams
We study the subgraph counting problem in data streams. We provide the first non-trivial estimator for approximately counting the number of occurrences of an arbitrary subgraph H of constant size in a (large) graph G. Our estimator works in the turnstile model, i.e., can handle both edge-insertions and edge-deletions, and is applicable in a distributed setting. Prior to this work, only for a fe...
متن کاملApproximate Counting of Cycles in Streams
We consider the subgraph counting problem in data streams and develop the first non-trivial algorithm for approximately counting cycles of an arbitrary but fixed size. Previous non-trivial algorithms could only approximate the number of occurrences of subgraphs of size up to six. Our algorithm is based on the idea of computing instances of complex-valued random variables over the given stream a...
متن کاملInvestigating the neural correlates of continuous speech computation with frequency-tagged neuroelectric responses
In order to learn an oral language, humans have to discover words from a continuous signal. Streams of artificial monotonous speech can be readily segmented based on the statistical analysis of the syllables' distribution. This parsing is considerably improved when acoustic cues, such as subliminal pauses, are added suggesting that a different mechanism is involved. Here we used a frequency-tag...
متن کاملCounting Dependence Predictors Undergraduate Honors Thesis
Modern processors rely on memory dependence prediction to execute load instructions as early as possible, speculating that they are not dependent on an earlier, unissued store. To date, the most sophisticated dependence predictors, such as Store Sets, have been tightly coupled to the fetch and execution streams, requiring global knowledge of the in-flight stream of stores to synchronize loads w...
متن کاملContinuous Distributed Monitoring A Short Survey
In the model of continuous distributed monitoring, a number of observers each see a stream of observations. Their goal is to work together to compute a function of the union of their observations. This can be as simple as counting the total number of observations, or more complex non-linear functions such as tracking the entropy of the induced distribution. Assuming that it is too costly to sim...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012